Trb06-0960 Effects of Sample Size on the Goodness-of-fit Statistic and Confidence Intervals of Crash Prediction Models Subjected to Low Sample Mean Values

نویسندگان

  • Ravi Agrawal
  • Dominique Lord
چکیده

The statistical relationship between motor vehicle crashes and covariates can generally be modeled via generalized linear models (GLMs) using logarithmic links with errors distributed in a Poisson or Poisson-gamma manner. The scaled deviance (SD) and Pearson’s X are tools that have been proposed to test statistical fit of GLMs. Recent studies have shown that these two estimators are not adequate for testing the goodnessof-fit (GOF) of GLMs when they are developed from data characterized with low sample mean values. To circumvent this problem, a testing method has been proposed to evaluate the goodness-of-fit of such GLMs. Given the fact that this method can be timeconsuming to implement, there is a need to determine whether this technique is sensitive to different sample sizes. The primary objective of this paper was to investigate the effects of decreasing sample sizes on the GOF testing technique. A secondary objective was to estimate how the reducing of sample size influences the confidence intervals of GLMs. In order to accomplish the objectives of the study, GLMs were fitted using two datasets subjected to average and low sample means collected in Toronto, Ontario. Several models were estimated for different sample sizes. The results of the study show that the testing technique is more effective for smaller samples than for larger samples when data is subjected to low sample mean values. The results also show that the width of the confidence intervals increases, as expected, as the sample size decreases, and can be extremely large for very small sample sizes. Hence, statistical models characterized by low sample mean values should be developed using a large number of observations. In fact, it is recommended to develop models using datasets containing at least 100 observations (e.g., intersections, segments, etc.). The paper concludes with recommendations for future studies involving such datasets.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Goodness-of-fit testing for accident models with low means.

The modeling of relationships between motor vehicle crashes and underlying factors has been investigated for more than three decades. Recently, many highway safety studies have documented the use of negative binomial (NB) regression models. On rare occasions, the Poisson model may be the only alternative especially when crash sample mean is low. Pearson's X(2) and the scaled deviance (G(2)) are...

متن کامل

Modeling motor vehicle crashes using Poisson-gamma models: examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter.

There has been considerable research conducted on the development of statistical models for predicting crashes on highway facilities. Despite numerous advancements made for improving the estimation tools of statistical models, the most common probabilistic structure used for modeling motor vehicle crashes remains the traditional Poisson and Poisson-gamma (or Negative Binomial) distribution; whe...

متن کامل

An Empirical Evaluation of the Use of Fixed Cutoff Points in RMSEA Test Statistic in Structural Equation Models.

This article is an empirical evaluation of the choice of fixed cutoff points in assessing the root mean square error of approximation (RMSEA) test statistic as a measure of goodness-of-fit in Structural Equation Models. Using simulation data, the authors first examine whether there is any empirical evidence for the use of a universal cutoff, and then compare the practice of using the point esti...

متن کامل

Parameter Estimation in Astronomy with Poisson-distributed Data. Ii. the Modified Chi-square-gamma Statistic

I investigate the use of Pearson’s chi-square statistic, the Maximum Likelihood Ratio statistic for Poisson distributions, and the chi-square-gamma statistic (Mighell 1999, ApJ, 518, 380) for the determination of the goodness-of-fit between theoretical models and low-count Poisson-distributed data. I demonstrate that these statistics should not be used to determine the goodness-of-fit with data...

متن کامل

Effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter of Poisson-gamma models: A Bayesian Perspective

There has been considerable research conducted on the development of statistical models for predicting motor vehicle crashes on highway facilities. Many of these developments were performed for the likelihood-based or frequentist modeling approach. Over the last few years, there has been a significant increase in the application hierarchical Bayes method for modeling motor vehicle crashes. Whet...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006